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(57) Abstract: A system and method for metaboUc profiling for different species. The system comprises data from a wide variety of 
laboratory sensors and other measurements combined with an algorithmic database allowing multi-variate analysis of the data to be 
performed. Protocol information is stored concerning the data resident within the database. Data is retrieved throughout the growing 
season, across growing seasons, and for a wide variety of response variables, and for different species, thus allowing a wide variety 
of analysis across species and within species to take place. Data is stored concerning the workflow of any particular analysis so that 
scientists can build upon successful analyses to create further novel methods of analyzing metabolic data. 
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1 Title: System and Method for Metabolic Profiling 

2 

3 

4 

5 Field of the Invention: 

6 This invention relates generally to determining characteristics for healthy or 

7 beneficial agricultural products. More particularly, the present invention is a system and 

8 method for creating and analyzing metabolic profiles of plants for purpose of optimizing 

9 present and future agricultural products. 
1 0 Background of the Invention: 

^ ^ ^ Pla^t species, genetic and chemical variation for a new growth cycle of the 

12 species can lead to a good or a poor harvest. Further, various diseases and environmental 

13 conditions can affect the yield of a particular crop sometimes with very beneficial and 
other times very disastrous results. For example, in the early seventies a com blight 
manifested itself throughout the Midwestem part of the United States. This com blight 

16 wiped out approximately 25% of the com harvest yielding many billions of dollars of 

17 agricultural damage not to mention a ripple effect on prices for com products as well as 

18 products that relied upon com for feed purposes. The end result was billions of dollars of 
i 9 damage throughout the economy of the United States. 

This led to a great deal of research on such things as environmental conditions, 

2 1 amount of fertilizer, and method of detection at an early stage when beneficial or 

22 detrimental circumstances exist with a particular crop. These various activities intended 

23 to focus on individual environmental factors as well as other man made factors such as 

24 amount of fertilizer applied and the like. The response variable for these individual 

25 studies has tended to be the amount of crop harvested. While this is certainly a useful 

26 "bottom line" figure, it does little to evaluate a trend in a crop to be productive or not. 

27 Yet there are many aspects throughout the growth cycle of plants that can be analyzed and 

28 correlated with good or bad harvest. For example, the near infrared spectmm of a crop 

29 and the absorption in the chlorophyll absorption band tend to change early when stress of 

30 one sort of another occurs in a given crop. This indicator can be detected early. Further, 
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1 various metabolic process occur during the growth cycle of a plant. The result of these 

2 metabolic processes are a series of "metabolites" which are present in the particular plant 

3 as a result of the various growth cycle functions. These metabolites will differ from plant 

4 to plant and will also differ during the course of the growth cycle of the plant. Using 

5 various technologies a wide variety of indicators of the metabolic activity of a plant 

6 species can be measured. For example, and without limitation, gas chromatography, high 

7 performance liquid chromatography, liquid chromatography mass spectrometry, ultra 

8 violet through the visible range of spectrophotometry both narrow band and wide band, 

9 thin layer chromatography, and infrared spectrometry in the near infrared midwave 

10 infrared and Far IR are all candidates technologies for measurement of the metabolights of 

11 various plant species. 

12 In addidon to these precise laboratory measurements, data can be recorded 

13 concerning weather, soil, plant physiology, geography, hydrology, genotype, gene 

14 expression, primary metabolites produced, processes metabolites can all be measured and 

15 stored for subsequent analysis. 

16 While the above addresses issues of various plant varieties during the growing 

17 season there is also a significant post harvest issue that resuhs when subsequent chemical 

18 processes, both natural and man made, continue after a plant species is harvested. For 

19 example, the ripening of fruits of all different kinds is a process that occurs frequently 

20 after the particular fruit in questions has been harvested. Knowing when to harvest, 

21 bananas, apples, peaches, and grapes to name but a few and, given the known time to 

22 market, can yield a better product on the shelves. The ripening process is a continuing 

23 chemical and metabolic process that condnues in plants after they are harvested. The 

24 same holds true for various beverages that result from brewing and fermentation process 

25 such as beer and wine. Determining when the appropriate metabolic characteristics of a 

26 successfiil flavor exists can be critical to the generation of revenue and the building of the 

27 reputation of a particular organization for producing that particular product, 

28 Further, each different type of agricultural crop has some similar measures, in the 

29 case of environmental conditions such as humidity, rain fall, days of rain in a given 

30 period, days of drought in a given period, and the like. In addition however, each crop 
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1 also has its own measurement characteristics and terms associated with the crop in 

2 question. Taste in a particular crop such as wine, fruit, and edible crops are very 

3 important. However, other types of crops, which are used for widely different purposes 

4 such as wood products, rely on totally different measurements. Thus different amounts of 

5 the same metabolites among species of plants do not necessarily symbolize a positive or 

6 negative trend in any of the species individually. 

7 The monitoring of genetic expression also cannot be over emphasized. The fact 

8 that certain genes mutate in small or large ways also does not mean that the result in 

9 harvest will be bad. Indeed a gene mutation may led to a beneficial development in a 

10 particular crop. Once such gene mutations are uncovered and are determined to be 

1 1 beneficial, such genetic information, which results in enhanced traits of interest, maybe 

12 desired to be replicated to create entire crops comprising such mutated genes. Such crops 

13 could then be used for their normal commercial purposes or in the pharmaceutical field for 

14 example. 

15 Given the above situation, wherein a variety of historical measurement variables 

16 exist and have been recorded and will be recorded in the future, where environmental 

17 information for a given crop also exists and can be measured in the future, where new 

18 laboratory techniques exist for measuring a variety of response variables which 

19 historically have no parallel, and given the fact that all of these different types of variables 

20 are in different quantities for different species of plants and indeed may exist for one 

21 species of plant and not for another, one has an analytical environment with unlimited and 

22 uncorrelated measurement combinations. Indeed, only some of these various response 

23 variables have only been used in a simplistic fashion to roughly correlate quality of crop 

24 with a single particular variable. Yet there is no comprehensive integrated way of 

25 analyzing all of these variables to determine how they co-vary and what the significance 

26 of that co-variance is. 

27 What would be truly useful is a system and method that can be universally applied 

28 on a species by species basis to measure a wide variety of variables, determine how those 

29 variable co-vary with one an other and what the significance of that co-variance truly is. 

30 Such a system would utilize past historical measurement variables to provide appropriate 
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1 context for professionals operating in the particular agricultural arena in question, would 

2 allow for the input of new response variables as laboratory techniques become 

3 increasingly sophisticated, and would penmit correlation analysis of these various types of 

4 environmental, laboratory, and historical types of measurement to determine in an 

5 efficacious way how best to manage and harvest a wide variety of agricultural crops and 

6 products. 

7 What would further be usefiil is a transparent user friendly system which integrates 

8 data from various data sources which vary by data type and vary with species being 

9 analyzed. Such a system would also comprise a variety of analysis tools for multi-variate 

10 analysis, modeling, simulation, and visualization of resultant data. The system would also 

1 1 be able to take the results of successful research and store the steps used to obtain that 

12 successful research so that the scientific inferences and steps to achieve them are available 

13 for others to build upon. 

14 Summary of the Invention: 

15 It is therefore an objective of the present invention to record data, analyze a variety 

16 of data types, and predict the direction, either efficacious or not, for a particular crop 

17 during the growing season, 

1^ It is yet another objective of the present invention to analyze and predict, post 

19 harvest, the direction, efficacious or not, for agricultural products that are being stored. 

20 It is a fiirther objective of the present invention to be able to predict the direction, 

21 efficacious or not, for agricultural products that are the subject of post harvest processing. 

22 It is yet another objective to create a system and method for storing, acquiring, and 

23 displaying all maimer of metabolic data. 

24 It is yet another objective of the present invention to store and correlate the 

25 experimental protocol used to obtain various types of metabolic data. 

26 It is yet another objective of the present invention to store associated 

27 environmental and developmental data for a particular species. 

28 It is a further objective of the present invention to be able to compare and analyze 

29 metabolic data for the same species under a variety of environmental conditions. 
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1 It is further objective of the present invention to be able to siore and analyze 

2 metabolic data for the given species during various developmental stages in a growing 

3 season. 

4 It is a funher objective of the present invention to analyze data on the same species 

5 across various growing seasons at similar developmental stages. 

6 It is yet another objective of the present invention to be able to analyze genetic 

7 mutations of the same species under the same environmental conditions. 

8 It is a further objective of the present invention to analyze the metabolic profile 

9 variation of different species under the same environmental conditions. 

It is yet another objective of the present invention to perform a muld-variate 

1 1 analysis on a wide variety of environmentai, metabolic, and genetic information of a 

12 particular species to determine optimal growth conditions so that a given species will 

13 display an optimal set of traits. 

It is yet another objective of the present invention to perform a multi-variate 

15 evaluation of metabolic, genetic, en\droirmenLal condidons, to determine the optimal time 

16 to harvest the particular species in question. 

It is yet another objective of the present invention to provide for the multi-variate 

18 analysis of metabolic, environmental, and genetic information of a particular species to 

19 characterize and optimize mutant strains for a panicular species. 

It is a further objective of the present invention to be able to browse and quer>' data 
21 of various types representing various species in a user friendly fashion. 
2^ I^ is yet another objective of the present invention to be able to integrate different 

23 data regardless of database schemata, semantics, or syntax. 

It is a further objective of the present invention to allow scientists to visualize 

25 information via a single system without having to download multiple tools from niuhiple 

26 locations and to convert files from one data type to another. 

It is yet another objective of the present invention to allow scientists to have 

28 analysis algorithms tightly integrated v/ith database resources so that multi-variate 

29 analysis tools are readily a\'ailable. 
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It is yet another objective of the present invention to allow scientists to define 
workflows which can be executed repetitively on large batches of data from one growing 
season to another. 

It is a further objective of the present invention to allow scientists to store 
discoveries or inferences of metabolic data so that they can be used to build upon for 
further research. 

These and other objectives of the present invention will be apparent to those 
skilled in the art from a review of the specification that follows. 

The present invention is a method and apparatus for multi-variate metabolic 
profiling that provides for a correlation between metabolic, genetic, and environmental 
factors that effect the growth of a particular species. The present invention allows a 
disciplined approach to the analysis and determination of optimal set of traits that 
characterize a successfol crop. Conversely, the present invention allows for multi-variate 
analysis to determine when the particular crop is trending toward a detrimental harvest so 
that intervention can occur at an early stage thereby preventing economic hardship and 
reduced productivity. 

The present invention comprises a database and processing capability for accepting 
a wide variety of input from field readings, environmental readings, and laboratory 
readings for a particular species and for analyzing those various readings in a multi-variate 
fashion to make deterministic analysis for the species in question. 

For each variable being measured, raw data on the variable is stored together with 
the sampling protocol used to obtain the data. Further, data is stored based upon the 
species from which the data is collected. Thus for each species, raw data and a wide 
variety of field, environmental, and laboratory types will be stored together with the 
protocol used to collect each of the different types of data. This information is also stored 
according to the species about which the data is collected In this fashion, a multi 
dimensional data base is created. 

Whole-organism metabolic profile data is also obtained and stored for a specific 
species. For example, and without limitation, gas chromatography - mass spectrometry 
(GC-MS), high performance liquid chromatography (HPLC), high performance liquid 
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chromatography-mass spectrometry (LCMS), thin layer chromatography (TLC), ultra 
violet and visible spectrophotometry (UV-VIS), short wavelength, mid wavelength, and 
long wavelength infrared (SWIR, MWIR, LWIR), raman spectrometry, and biosensor 
information are all collected and stored within the data base of the present invention. This 
information is collected for any given species desired, and is collected during various 
stages of the growing season, and over longer periods of time, on a season by season basis 
for the same variables during the same periods of respective growing seasons. As noted 
above the protocol for the collection of each type of this information is stored and 
associated with the data being stored. 

A separate portion of the data base stores environmental information such as 
temperature, pressures, humidity, concentrations of exogenous (added) compounds such 
as fertilizer and other physical and chemical properties which may controlled by the crop 
owner or the experimentalist. 

A library of various algorithmic approaches is also stored together with association 
of which multi variate analysis technique is best for a particular species, response variable, 
or other characteristic. Thus multi-variate analysis and/or principal axes factor analysis or 
other multi variate analysis algorithms are stored in an algorithmic data base and 
associated with the various species and response variables which are best used in 
conjunction with an algorithmic analysis. 

The system further provides an automated method for data retrieval and analysis as 
well as numeric and visual display of data to optimize the human factors interaction with 
the very complex data base of the present invention. 

Thus if, for example, a particular analyst wishes to review the trend for an orange 
crop the analyst would select the crop to determine the end product desired. For example, 
for vitamin production or orange juice production, the analyst would input the desired end 
product, and information from the multi-dimensional data base will be automatically 
retrieved, based upon those variables which are most predictive in nature, the appropriate 
analysis algorithm will be selected, the data will analyzed, and an appropriate output will 
be created for the analyst noting, among other things, the direction for the crop, 
(efficacious or not) whether intervention steps must be taken, what those steps should be, 
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1 and what the protocol would be for future analysis to monitor the crop in question. In 

2 addition, the system of the present invention has the ability to generate a sampling plan for 

3 a researcher or farm owner who wishes to generate information about the yield of a crop 

4 over time and/or during a growing season. 

5 As noted earlier, this not simply an academic exercise. This type of analysis is 

6 envisioned for various points in the growing season and would be accessed by individual 

7 farmers to determine the direction for their specific crops as well as for others in the 

8 government or in private investment industry that wish to determine the future prospects 

9 for a crop in question. Thus the economic potential for the present invention is 
10 significant. 

^ ^ The present invention is implemented on a Sun Microsystem server which runs the 

12 data base, analysis algorithms, server software and four connecting to a work stations. 

13 The work stations are connected over a network which may be a local area network 

14 (LAN), a wide area network (WAN), and/or work stations connected to the Intemet. 

15 These network connections are but one example of the type of network connection and are 

16 not meant as a limitation. For example, workstations may be connected in a wireless 

17 fashion to the server of the present invention simply by means of a transceiver located at 

18 the workstation location and at the server. Alternatively connections exist between remote 

19 work stations operating wirelessly to Intemet service providers and thence to the Intemet 

20 for connection to the server of the present invention, 

2^ Analysis algorithms that are stored in the server and used for the present invention 

22 are, for example, and without Hmitation, multi-variate statistics and artificial intelligence 

23 algorithms such as clustering algorithms, multi-variate factor analysis, principal axes 

24 factor analysis, and other types of multi-variate algorithms which are capable of being 

25 exercised by the server of the present invention. In addition curv^e fitting algorithms of 

26 various types known in the art are stored and available to the analyst as are various patent 

27 recognition algorithms, flux analysis together with metabolic control analysis and various 

28 visualization options for display of data. This information is integrated together with 

29 proteomics and gene expression data bases to allow correlations with these types of data 

30 as well. 
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1 It is also the situation that measurement of certain of the response variables that 

2 are of an historical nature and those of a laboratory nature follow different protocols for 

3 their collection. For example, collecting information on the sugar content of grapes may 

4 take place at one frequency during the course of the growing season while information on 

5 genetic expression may take place at an entirely different frequency during the growing 

6 season for the same crop. Conversely, crops that are harvested only once every tens of 

7 years, such as wood and paper product type crops may have sampling frequencies that are 

8 radically different than those crops that are grown and harvested within a single season. 

9 Brief Description Of The Drawings 

10 Figure 1 illustrates response variable types at a particular time. 

1 1 Figure 2 illustrates relationships between response variable and collection 

12 protocols. 

13 Figure 3 illustrates the conceptual database structure. 

14 Figure 4 illustrates the overall system architecture. 

1 5 Detailed Description Of The Invention 

16 Referring to Figure 1, the conceptual framework for response variables is 

17 illustrated. Response variables 100 comprise environmental data 102, laboratory data 104, 

18 metabolic data 106, and genetic data 108. Environmental data 102 may comprise several 

19 types of environmental data 1 10, 1 12, and 114 which may be rainfall, humidity, 

20 temperature, and indeed any other type of environmental information that may be 

21 important to the growth cycle of a particular species under analysis. 

22 Laboratory data 104 comprises multiple types of data as well, herein illustrated as 

23 type 1 116, type 2 118, type 3 120, and type 4 122. 

24 Various types of metabolites are also recorded in metabolic data 106. Here 

25 metabolites 124, 126, 128, and 130 are recorded with respect to their presence as well as 

26 their concentrations. 

27 Finally genetic data 108 is also recorded as a series of observations with the 

28 presence of various mutant genes for a particular species. Thus, generic data 132, 134, 

29 136, and 1 38 are observed and their presence recorded. 
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1 It should be noted that all of these response variables 100 are recorded at a 

2 particular time during the growing cycle Tl . 

3 Referring to Figure 2, collection information in the form of data protocols is 

4 conceptually illustrated. Response variable data 1 00, which comprises response variables 

5 102, 104, and 108 are all associated with a data protocol information 160 which comprises 

6 data protocols 162, 164, and 166. These data protocols are each associated with the 

7 response variables so that an analyst/researcher can determine how a particular response 

8 variable was derived and what the various data sampling schemes were that were 

9 associated with each particular response variable. 

10 Referring to Figure 3, the conceptual database structure is illustrated. Response 

1 1 variables 1 00 are recorded for a particular time Tl during the growing season (GS 1 ), 1 80. 

12 The same response variables are also recorded for times T2 170 and TN 172 which 

13 represent various times for sampling during growing season number 1, 180. This same 

14 type of response variable sampling during different times in a growing season is also 

15 recorded for growing season 2, 1 82 and growing season N 1 84. 

16 All of this information regarding response variables 100 recorded at different times 

17 for different growing seasons are all recorded for a particular species 1 90. This sampling 

1 8 and recording of data is also done for additional species 1 92 and 1 94, Thus a particular 

19 response variable may be analyzed across species, across growing seasons, and across 

20 different sampling times during a particular growing season. All of this information may 

21 be parsed and analyzed in a multi-variate way. It should be noted that the specific number 

22 of growing seasons, species, response variables, and sampling times are all illustrative in 

23 nature and are not meant to be limiting. In practice, there will be many samphng times 

24 during a particular growing season, and many species of commercial value may be 

25 analyzed in the fashion noted in the present invention. 

26 Referring to Figure 4, the overall system architecture is illustrated. Metabolic 

27 processor 200 has a series of supporting databases. Raw data from various environmental, 

28 laboratory, and other measurements are stored for the data types and sampling first noted 

29 in the conceptual database structure (Figure 3). These data are called upon for the various 

30 analysis desired by the scientists. 
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1 A protocol database 204 is also stored and is related to the various data stored in 

2 the raw database 202. In this fashion, an analyst can analyze any single piece of data and 

3 determine the protocol that was used to obtain it. 

4 Types of response variables are stored in a response variable database 206 which 

5 allows an analyst to determine what types of data may be resonant in the database and 

6 what types of data could be obtained in order to support any analysis task. 

7 An algorithmic database 208 is available to the analyst for subsequent loading on 

8 the metabolic processor 200. This provides the analyst with a wide variety of multi- 

9 variate analyses such as, and without limitation, clustering algorithms, flux analysis, 

10 metabolic control analysis, multi -variate factor analysis, principle axes factor analysis, 

1 1 curve fitting, pattern-recognition, and similar tools. All of these tools are stored in 

12 algorithmic database 208 and can be loaded on metabolic processor 200 to serve as the 

13 basis for analyzing raw data 202 concerning any particular species or trends of data within 

14 the species. 

15 Once a particular algorithm in combination with certain response variables and 

16 raw data are determined to be useful, that specific analysis is stored in an analysis 

17 database 220. By storing the appropriate analysis steps, a subsequent scientist can access 

18 metabolic processor 200 via workstations 210, 212 and request a specific analysis for a 

19 specific species* This algorithm will be then retrieved from the analysis database 220 

20 which will automatically cause the appropriate raw data to be retrieved and analysis 

2 1 results to be output to the researcher at the workstation. 

22 A series of workstations are connected to the metabolic processor 200. As 

23 illustrated, workstations 210 and 2 12 are connected via a local area network to the 

24 metabolic processor 200. 

25 Metabolic processor 200 can also be accessed over a network 214 by remote 

26 v/orkstations 2 1 6 and 218. Examples of such a network can be an intranet, the internet, or 

27 any other network suitable for providing remote access to a central processor. 

28 As noted earlier, the present system is implemented on a sun microsystems server 

29 for running the database, analysis algorithms, and server software. This will allow any 

30 computer with the web browser to act as a client for the metabolic processor 200. 
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1 Generally any type of workstation such as an IBM PC or compatible running, for example, 

2 a Pentium processor having local storage, and output capability will be suitable for a client 

3 station for the system. 

4 Various technologies will serve as the basis for collecting raw data of a laboratory 

5 nature concerning species of interest. For example, and without limitation, gas 

6 chromatography, mass spectrometry, HPLC, LCMS, TLC, UV-VIS, SWIR, MWIR, 

7 LWIR, raman spectrometry, and various bio-sensor information can all be collected and 

8 tagged with the appropriate protocol used for the collection and associated with a 

9 particular species and timed during the growing season during which the samples were 
10 taken. 

1 ^ Using the metabolic information recorded in the database structure, and using the 

12 system of the present invention, qualitative studies can be accomplished to determine 

13 which metabolites are expressed and potentially discover novel compounds which are 

14 indicative of the quality of a particular harvest. 

15 In addition, quantitative analysis can be conducted to measure concentrations of 

16 metabolites during the course of the growing season in order to determine a trend for the 

17 particular crop in question. In this fashion, it will ultimately be possible to create 

18 predictive models to assist in optimizing any particular crop for desired characteristics of 

19 yield and quality. Further, by bringing together these disparate data types, it will be 

20 possible for a scientist or analyst to be able to evaluate in a streamline fashion data that 

21 heretofore, has not been able to be combined in any meaningful fashion within a database. 

22 With knowledge of how to conduct a specific type of study, an analyst can simply 

23 input a desired end product for an analysis, such as, for example and without limitation, 

24 how much orange juice can a particular crop produce? The system can then select the 

25 appropriate analysis algorithm, pull the data from the database, and create the predictive 

26 model or response. If, on the other hand, vitamin or supplement products are desired from 

27 the orange crop, a different model may be run, using perhaps a different predictive 

28 algorithm from the database. 
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1 In the alternative, a grower can ask the system what type of data and sampling 

2 rates are required if that grower is to make a prediction for an optimized amount of 

3 product from a given crop. 

4 A system and method for metabolic profiling has been described. It will be 

5 apparent to those skilled in the art that other types of data can be brought into the system 

6 for analysis, the types of analysis tools can be stored in the analysis database for use by 

7 scientists, other types of protocols for obtaining different types of data may also be created 

8 and stored for later access by the scientist without departing from the scope of the 

9 invention as disclosed. 
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1 I claim: 

2 1 . An apparatus for metabolic profiling comprising: 

3 a processor; 

4 a database connected to the processor for storing data concerning a 

5 plurality of plant species; 

6 at least one workstation for accessing the processor; and 

7 a database comprising metabolic data concerning the plurality of plant 

8 species. 

9 2. The system for metabolic profiling of claim 1 wherein the database further 

10 comprises an algorithmic database comprising computer programs for multi-variate 
1 i analysis of the metabolic data. 

12 3. The system for metabolic profiling of claim 2 wherein the computer programs 

13 for multi-variate analysis are taken fi-om the group consisting of clustering algorithms^ 

14 flux analysis, metabolic control analysis, system identification, multi-variate factor 

15 analysis, principle axes factor analysis, pattern-recognition, and curve fitting programs, 

16 4. The system of metabolic profiling of claim 1 when the database further 

17 comprises protocol data correlating metabolic data and how metabolic data was obtained. 

18 5. The system for metabolic profiling of claim 1 wherein the database fiirther 

19 comprises workflow routines for established analysis techniques. 

20 6. The system for metabolic profiling according to claim 1 wherein the database 

21 fiirther comprises environmental data. 

22 7. The system for metabolic profiling according to claim 1 wherein the database 

23 further comprises species developmental data. 

24 8. The system for metabolic profiling according to claim 6 wherein the processor 

25 fiirther comprises instructions for analyzing the same species as a fimction of the 

26 environmental data. 

27 9, The system for metabolic profiling according to claim 7 wherein the processor 

28 fiirther comprises instructions for analyzing the same species as a function of the 

29 developmental data. 
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10. The system for metabolic profiling according to claim 6 wherein the processor 
further comprises instructions for analyzing different species as a function of the 
environmental data. 

1 1 . The system for metabolic profiling according to claim 1 wherein the metabolic 
data comprises data from analysis of plant species using laboratory sensors. 

12. The system for metabolic profiling according to claim 1 1 wherein the 
laboratory techniques comprise techniques from the group consisting of GS-MS, HPLC, 
LCMS, TLC, UV-VIS, SWIR, MWIR, LWIR, raman spectrometry, and biological 
sensing. 
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^ (57) Abstract: A system and method for metabolic profiling for different species. Tiie system comprises data from a wide variety of 
^ laboratory sensors and other measurements combined with an algorithmic database allowing multi-variate analysis of the data to be 
performed. Protocol information is stored concerning the data resident witbin the database. Data is retrieved throughout the growing 
^ season, across growing seasons, and for a wide variety of response variables, and for different species, thus aUowing a wide variety 
^ of analysis across species and within species to take place. Data is stored concerning the woricflow of any particular analysis so that 
scientists can bmld upon successful analyses to create further novel methods of analyzing metaboUc data. 
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